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GENOTYPING CYTOCHROME EXPRESSION 



FIELD OF THE INVENTION 

The present invention is concerned with an assay and, 
in particular, with an assay for genotyping a 
polymorphism predictive of a phenotype associated with 
cytochrome expression, in this case CYP3A5 . 

BACKGROUND OF THE INVENTION 

The cytochrome P4 5 0 subfamily CYP3A represents one of 
the most important families of the P4 50 superfamily 
and plays a major role in the metabolism of an ever 
expanding list of therapeutic compounds (23, 24). This 
family comprises the most abundantly expressed P4 50s 
in human livers, and is responsible for the metabolism 
of over 50% of all clinically used drugs, including 
the dihydropyridines, cyclosporin, erythromycin and 
barbiturates (1) . Wide inter-individual variation in 
the metabolism of CYP3A substrates has been noted and 
is a factor in determining individual drug efficacy. 
Evidence also exists for the metabolism of an array of 
lipophilic environmental pollutants, including the 
activation of pro-carcinogens such as aflatoxin Bl by 
members of this subfamily (2) . 

Presently, four CYP3A cDNAs have been identified in 
humans, CYP3A3, CYP3A4 , CYP3A5 and CYP3A7 . It is 
believed that CYP3A3 represents an allelic variant of 
CYP3A4, whilst CYP3A4 and CYP3A7 are found only in 
human adult and fetal livers respectively (3) . Initial 
experiments suggested that a polymorphism existed in 
CYP3A4 (4) . However other studies, whilst confirming a 
wide range of inter- individual variation in CYP3A4 
expression have failed to confirm the original 
bimodality (5, 6). Overlapping substrate specificities 
between CYP3A5 and CYP3A5 have previously made it 



difficult to separate metabolism by these isoforms; 
consequently little phenotyping data have been 
produced to study variation in CYP3A5 activity in 
humans. However, there is evidence for the polymorphic 
expression of CYP3A5 . Use of both immunoblotting and 
Northern analysis have detected CYP3A5 expression in 
only 10-3 0% of human livers (7, 8, 9) . More recently, 
analysis of 3 0 human liver samples using 
immunoblotting found that only 3% showed no detectable 
CYP3A5, whilst a large number had trace amounts, 
suggesting that a polymorphism in this enzyme may be 
regulatory as opposed to structural (10) . 
Comparisons of the 5 1 flanking regions from the 
CYP3A4, 3A5 and 3A7 genes have identified putative 
binding sites for several transcriptional regulatory 
factors common to all isoforms (11, 12, 13). However, 
the molecular basis, if any, for this inter- individual 
variation in expression of the CYP3A sub- family 
members has so far remained unclear. Indeed it has 
been suggested that the host cellular environment may- 
be a greater determinant of inducibility than gene 
structure (14) . However, the determination of a major 
genetic component to variant expression and activity, 
linked to an easy screening method, would be extremely 
beneficial, not only in providing a predictor of 
individual response to drugs which are metabolised by 
these isoforms, but also in facilitating association 
studies between CYP3A and disease processes. 

The delineation of CYP3A4 and CYP3A5 metabolism has 
been shown to be possible using the sedative midazolam 
as a probe drug (15) . In this case two metabolites are 
formed, 1 -hydroxy midazolam (1-OHM) and 4 -hydroxy 
midazolam (4 -OHM) . Those samples containing a higher 
proportion of CYP3A5 compared to CYP3A4 have their 
metabolism driven towards the 1-OHM route and 
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therefore show a higher ratio of 1 -OHM/4 -OHM than 
those containing only CYP3A4 . The present inventors 
have now established that two polymorphisms, located 
in putative transcriptional regulatory regions, which 
caused increased CYP3A5 gene expression and metabolic 
activity are linked and have developed assays for 
their detection. These assays will allow prediction 
of inter- individual variability in response to drugs 
metabolised by this isoform, as well as facilitating 
disease association studies. 

BRIEF DESCRIPTION OF THE FIGURES 

The invention may be more clearly understood by the 
following example with reference to the accompanying 
drawings wherein 

Fig. la: is an illustration of the relationship 



between midazolam metabolic ratio and 
genotype for the linked A_ 147 G and T„ 475 G 
mutations in the 5' flanking region of the 
CYP3A5 gene. Midazolam metabolic ratio = 1- 
OHM/4-OHM, wt = samples with the wild type 
sequence in the 5 » flanking region as 
previously published (11) , Het = samples 
heterozygous for the linked polymorphisms, 
A_ 147 G and T_ 475 G. 



Fig. lb: is an illustration of the relationship 



between CYP3A5 mRNA expression and the 
linked A_ 147 G and T_ 475 G mutations in the 5' 
flanking region of CYP3A5 . Relative Ct 
difference = difference in threshold cycle 
between samples, as described in the methods 
section wt = samples with the wild type 
sequence in the 5' flanking region as 
previously published (11) Het = samples 



heterozygous for the linked polymorphisms, 
A_ 147 G and T_ 475 G . 

Fig* 2a: is a diagram of relative position of 

primers, and of the recognition site for the 
restriction enzyme Tai I, which is 
introduced into the PCR product utilising 
mismatched primer 3A5R1 when the wild-type 
"A" nucleotide is present at position -147, 
and is lost when the mutant "G" nucleotide 
is present. 

Fig. 2b: is a diagrammatical representation of 

expected restriction fragments for each 
possible genotype for the A_ 147 G variant, 
i.e. homozygous wild-type, heterozygous and 
homozygous mutant. 



Fig. 2c: is an illustration of a 1.5% agarose gel of 
Tai I restriction digest of 3A5F2/3A5R1 PCR 
product for detection of the A„ 147 G variant. 

Lane 1. 100 bp ladder. Lanes 2 & 7. 
Reference undigested PCR products . Lane 3 . 
Sample homozygous for the wild- type "A" 
nucleotide at position -147. Lanes 10, 11, 
16. Samples heterozygous for the A_ 147 G 
variant . 



Fig. 3a: is a diagram of relative position of 

primers, and of the recognition sites for 
the restriction enzyme Alu I. The forward 
recognition site is introduced into the PCR 
product utilising mismatched primer 3A5F2 
when the wild-type W T" nucleotide is present 
at position -475, and is lost when the 
variant "G" nucleotide is present. 



JAB 1462 



Fig. 3b: is a diagrammatical representation of 

expected restriction fragments for each 
possible genotype for the T_ 475 G variant, 
i.e. homozygous wild-type, heterozygous and 
homozygous mutant . 

Fig. 3c: is an illustration of a 12.5% 

polyacryl amide ExcelGel of Alu I restriction 
digest of the 3A5F2/3A5R1 PCR product for 
detection of the T„ 475 G mutation. Lane 1.100 
bp ladder. Lanes 2, 5, 6, 7 & 8. Samples 
homozygous for the wild-type AT= nucleotide 
at position -147. Lanes 3, 4, 9. Samples 
heterozygous for the T. 475 G mutation. 
Fragment X - additional digestion product 
resulting from re-amplification of original 
template by primers 3A51/3A52. 

Fig. 4a: is a comparison of 1 -OHM/4 -OHM metabolic 
ratios between samples with the linked 
mutations (HET group) and those wild-type 
for the mutations at positions -147 and -4 75 
(WT group) . Mean and quartiles are shown for 
each group, as is overall mean for the 
combined groups (central line) . 

Fig. 4b: is a comparison of CYP3A5 expression (In 

transformed) between samples with the linked 
mutations (HET group) and those wild- type 
for the mutations at positions -147 and -475 
(WT group) . Mean and quartiles are shown for 
each group, as is overall mean for the 
combined groups (central line) . 

Fig. 5: is a Western blot analysis of CYP3A5 protein 
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expression in liver samples. A Western blot 
of microsomes prepared from liver samples 
and probed with a CYP3A5 specific antibody. 
Liver samples containing the linked 
polymorphisms at -14 7 and -4 75 (wt group) 
are marked * (sizes indicated in kDa from 
Wide Range Colour Marker (signs) ) . 

Fig. 6: is a list of oligonucleotide mismatch 
primers used in accordance with the 
invention, where the underlined nucleotide 
indicates the sequence mismatch. 

Fig. 7: is an illustration of the nucleotide 

sequence of the 5 ' flanking region relative 
to the DNA sequence encoding CYP3A5 . 

Fig. 8: is an illustration of the results obtained 

from an Electrophoretic mobility shift assay 
(EMSA) of A_ 147 G oligonucleotide. EMSA was 
carried out as described in materials and 
methods. Lane 1 : A-147G oligonucleotide 
without HeLa nuclear extract; lanes 2-8 : in 
the presence of HeLa nuclear extract; lanes 
3 and 4 : in the presence of 50 - 100 fold 
molar excess of unlabeled A-147G 
oligonucleotide; lanes 5 and 6; in the 
presence of 50 - 100 fold molar excess of 
unlabeled wild type oligonucleotide; lanes 7 
and 8 : in the presence of 1 and 2 
microlitres of anti-Spl antibody. 



Fig . 9 - 9d : 



are illustrations of the results obtained 
from the >find patterns^ program of the 
GCG sequence analysis package. 
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DETAILED DESCRIPTION OF THE INVENTION 

Therefore, according to a first aspect of the present 
invention there is provided a method of identifying 
subjects having a high or low drug metabolising 
phenotype associated with cytochrome CYP3A5 
expression, which method comprises screening for the 
presence or absence in the genome of a subject a 
polymorphic variant in a transcription regulatory 
region , such as, a promoter or enhancer adjacent the 
region encoding CYP3A5 . Preferably, the method 
involves screening for a variant in a recognition site 
for a transcription factor of said regulatory region, 
and even more preferably in an activator protein- 3 
motif or a basic transcription element. Even more 
preferably, the method involves screening for a 
variant at any one of positions -475 or -147 of the 
DNA of the 5 1 flanking region adjacent to the region 
encoding CYP3A5 the sequence of which flanking region 
is illustrated in Figure 7 and preferably, for both 
the variants at positions -475 and -147. 

In one embodiment of the method of the invention 
genomic DNA is amplified, preferably by the polymerase 
chain reaction using oligonucleotide molecules capable 
of hybridising selectively to the wild type sequence 
or the variant sequences, such that generation of 
amplified DNA from said molecules will indicate 
whether said wild type or mutation is present. In 
this method PCR primers hybridise either to the 
mutated or wild type sequence, but not both. 
Amplification of the DNA of the respective mutation or 
wild type genotype using the respective primers will 
provide an indication of the presence of the wild type 
or mutated nucleotide mutations. 

A further method of the invention advantageously 
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utilises oligonucleotide molecules as primers which, 
in addition to hybridising to the site of interest, 
are capable of introducing a restriction site which is 
absent in either the wild type sequence or polymorphic 
variants . Therefore, according to a further aspect of 
the invention, there is provided a method of 
identifying subjects having a high or low drug 
metabolising phenotype associated with CYP 3A5 
expression, which method comprises 1) amplifying 
genomic DNA from a subject using oligonucleotide 
molecules capable of hybridising to the wild type 
sequence and/or to the polymorphic variant sequence at 
a location being analysed, which molecules are such 
that they can introduce a restriction site at said 
location which is not present in the wild type or 
variant sequences, and 2) subjecting amplified DNA 
from step 1 to a restriction enzyme which cleaves the 
DNA at said restriction site to provide a restriction 
digest indicative of the presence or absence of said 
variant . 

The method preferably comprises amplifying DNA in a 
recognition site for a transcription factor of said 
regulatory region and preferably in an activator 
protein-3 motif (AP-3) and/or basic transcription 
element (BTE) . Preferably, the method comprises 
amplifying DNA spanning any of position -475 or -147, 
of the regulatory region of CYP 3A5 , the sequence of 
which is illustrated in Figure 7. 

The polymorphisms at the positions identified in each 
of the methods according to the invention comprise T_ 
475 6 G and A_ 147 6 G . As presented in the Examples 
below, the molecule used to detect the variation at A_ 
147 6 G is capable of introducing a restriction site 
for the enzyme Tai I only when the wild type A 




nucleotide is present at position -147. 
Alternatively, the molecule used to detect the T_ 475 6 
G nucleotide variation is capable of introducing a 
restriction site for the enzyme Alu I only when the 
wild type T nucleotide is present at position -475. 

In this embodiment an example of suitable primers is 
any of 3A5F1 GGGTCTGTCTGGCTGCGC 
and 3A5F2 (GGGGTCTGTCTGGCTGAGC) 
and 3A5R1 (TTTATGTGCTGGAGAAGGACG) . 

Using oligonucleotide mismatch primer 3A5R1 creates a 
Tai I recognition site only when the wild type A 
nucleotide is present at position -147. Digestion of 
the 3 6 9bp product with Tai I yields fragments of 34 9 
and 2 0bp for the wild type sequence, whilst the 
product remains undigested if a mutant, such as the G 
nucleotide, is present (Figure 2). Similarly, for the 
detection of the T_ 475 G mutation a second 
oligonucleotide mismatch primer 3AF2 may be used. 
This primer introduces a recognition site for the 
restriction enzyme Alu I when the wild type T is 
present at position -475, digesting the product to 
yield fragments of 318, 33 and 18bp. This site is 
lost when the mutant G nucleotide is present, yielding 
digestion products of 336 and 33bp (Figure 3) . 

Known techniques for the scoring of single nucleotide 
polymorphisms (see review by Schafer, A. J. and 
Hawkins, J. R. in Nature Biotechnology, Vol 16, pp33- 
39 (1998) include mass spectrometry, particularly 
matrix-assisted laser desorption/ionization time-of- 
f light mass spectrometry (MALDI -TOF-MS , se Roskey, M. 
T. et.al., 1996, PNASUSA, 93: 4724-4729), single 
nucleotide primer extension (Shumaker, J. M. et.al., 
1996, Hum. Mutat . , 7: 346-354/ Pastinen, T. et.al., 
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1997, Genome Res., 7: 606-614) and DNA chips or 
microarrays (Underhill, P. A. et.al. , 1996, PNAS USA, 
93: 196-200; Gilles, P. N. et.al. Nat. Biotech., 1999, 
17: 365-370). The use of DNA chips or microarrays 
could enable simultaneous genotyping at many different 
polymorphic loci in a single individual or the 
simultaneous genotyping of a single polymorphic locus 
in multiple individuals. 

In addition to the above, SNPs are commonly scored 
using PCR-SSCP based techniques, such as PCR-SSP using 
allele- specif ic primers (described by Bunce, 1995) . 
If the SNP results in the abolition or creation of a 
restriction site then genotyping can be carried out by 
performing PCR using non-allele specific primers 
spanning the polymorphic site and digesting the 
resultant PCR product using the appropriate 
restriction enzyme. The known techniques for scoring 
polymorphisms are of general applicability and it 
would therefore be readily apparent to persons skilled 
in the art that the known techniques could be adapted 
for the scoring of single nucleotide polymorphisms in 
the the regulatory region of CYP 3A5 . 

As would be readily apparent to those skilled in the 
art, genotyping is generally carried out on genomic 
DNA prepared from a suitable tissue sample obtained 
from the subject under test. Most preferably, genomic 
DNA is prepared from a blood sample, according to 
standard procedures which are well known in the art 
Also provided by the present invention is an 
oligonucleotide of at least 10 contiguous nucleotides 
to detect polymorphic variants in a 5 1 regulatory 
region adjacent the sequence encoding cytochrome 
CYP3A5 associated with a high or low drug metabolising 
phenotype. The oligonucleotide is capable of 
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hybridising to a region incorporating either a mutated 
or wild type nucleotide at position -475 or -147 of 
said flanking region, such that amplification of said 
positions will or will not proceed from said primer 
according to whether or not a polymorphic variant 
occurs at any of said positions. 

The oligonucleotide molecules of the invention 
are preferably from 10 to 50 nucleotides in length, 
even more preferably from 2 0-3 0 nucleotides in length, 
and may be DNA, RNA or a synthetic nucleic acid, and 
may be chemically or biochemically modified or may 
contain non-natural or derivatized nucleotide bases, 
as will be readily appreciated by those skilled in the 
art. Possible modifications include, for example, the 
addition of isotopic or non-isotopic labels, 
substitution of one or more of the naturally occurring 
nucleotide bases with an analog, internucleotide 
modifications such as uncharged linkages (e.g. methyl 
phosphonates, phosphoamidates , carbamates, etc.) or 
charged linkages (e.g. phosphorothioates , 
phosphorodithioates, etc.). Also included are 
synthetic molecules that mimic polynucleotides in 
their ability to bind to a designated sequence to form 
a stable hybrid. Such molecules are known in the art 
and include, for example, so-called peptide nucleic 
acids (PNAs) in which peptide linkages substitute for 
phosphate linkages in the backbone of the molecule. 
An oligonucleotide molecule according to the invention 
may be produced according to techniques well known in 
the art, such as by chemical synthesis or recombinant 
means . 

The oligonucleotide molecules of the invention 
may be double stranded or single stranded but are 
preferably single stranded, in which case they may 
correspond to the sense strand or the ant i sense strand 
of the 5 1 regulatory region of CYP3A5 . The 
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oligonucleotides may advantageously be used as probes 
or as primers to initiate DNA synthesis/DNA 
amplification. They may also be used in diagnostic 
kits or the like for detecting the presence of one or 
more variants alleles of the regulatory region of 
CYP3A5 . These tests generally comprise contacting the 
probe with a sample of test nucleic acid (usually 
genomic DNA) under hybridising conditions and 
detecting for the presence of any duplex or triplex 
formation between the probe and complementary nucleic 
acid in the sample. The probes may be anchored to a 
solid support to facilitate their use in the detection 
of these variants. Preferably, they are present on an 
array so that multiple probes can simultaneously 
hybridize to a single sample of target nucleic acid. 
The probes can be spotted onto the array or 
synthesised in situ on the array. (See Lockhart et 
al . , Nature Biotechnology, vol. 14, December 1996 
Expression monitoring by hybridisation to high 
density oligonucleotide arrays" . A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. Preferably, the 
oligonucleotides comprise any of the primers 3A5F1, 
3A5F2 and 3A5R1 as defined herein. 

Also provided is a kit to perform the method according 
to the invention. Preferably, the kit will comprise 
an oligonucleotide as described herein and even more 
preferably the kit will further comprise one or more 
restriction enzymes capable of distinguishing between 
wild-type or polymorphic variants as defined herein. 
Preferably, the restriction enzyme comprises Tai I or 
Alu I. 

According to a further aspect of the invention there 
is also provided a method of identifying toxic or 
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mutagenic effects of a test compound, such as, a drug, 
toxin or procarcinogen metabolised by CYP3A5 the 
method comprising contacting each of a cell having a 
high drug metabolising phenotype and a cell having a 
low metabolising phenotype associated with cytochrome 
CYP3A5 expression, with said test compound and 
identifying the effects of said compound on each of 
said high or low drug metabolising phenotype cells or 
other cells sensitive to said compound. An even 
further aspect comprises a method of diagnosing 
susceptibility of an individual to a disease 
associated with environmental toxins or procarcinogens 
metabolised by CYP3A5 , the method comprising the steps 
of 1) providing a sample containing DNA, and 2) 
identifying the presence or absence of a mutation in a 
transcription regulatory region adjacent to the DNA 
sequence encoding CYP3A5 using a reagent capable of 
distinguishing the presence or absence of a nucleotide 
in said regulatory site. According to this aspect of 
the invention, the mutation occurs in a recognition 
site for a transcription factor of said regulatory 
region and preferably in an activator protein-3 motif 
(AP-3) and/or a basic transcription element (BTE) . 
Preferably, the mutation occurs at any of positions - 
4 75 and -14 7 of the regulatory region and even more 
preferably at both positions where the mutation may be 
T_ 475 G or A_ 147 G. 

Advantageously, it is also envisaged that the 
regulatory region of the 5 ! flanking region can be 
used to identify or purify transcription factors which 
bind to the 5 1 region including the respective 
polymorphic variants. Thus, according to a further 
aspect of the invention, there is provided a method of 
identifying transcription factors capable of binding 
to a DNA fragment from a transcription regulatory 
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region adjacent DNA encoding cytochrome CYP3A5, said 
method comprising contacting said DNA fragment 
including said transcription regulatory region with 
potential transcription factors and identifying any 
transcription factor complexed to said DNA fragments. 

Using the transcription regulatory fragment it is 
possible to identify compounds or agents which exhibit 
or exert their effect on the transcription regulatory 
region of CYP3A5 . Thus, there is provided according 
to this aspect of the invention a method of 
identifying compounds acting on a transcription 
regulatory region adjacent to a DNA sequence encoding 
CYP3A5 , the method comprising transforming a cell with 
a DNA construct comprising the sequence of said 
regulatory region, and which regulatory region is 
operably linked to a sequence encoding a reporter 
molecule, contacting said cell with a test compound 
and identifying any expression of said reporter 
molecule. Preferably, said cell is expressing CYP3A5 
or is showing CYP3A5 activity. 

Also provided by the invention is a method of 
purification of transcription factors from a sample 
which are capable of binding to DNA from a 
transcription regulatory region adjacent a DNA 
sequence encoding cytochrome CYP3A5 , the method 
comprising contacting a DNA fragment including said 
transcriptional regulatory region with a mixture of 
transcription factors and identifying any complexes of 
said transcription factors and said fragment. 

An even further aspect of the invention comprises a 
method of providing a measure of activity of a 
transcription regulatory region adjacent to DNA 
encoding cytochrome CYP3A5 or alternatively a method 
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of identifying a mutation which alters the activity of 
the transcription regulatory region the method 
comprising providing a DNA construct having a sequence 
encoding a reporter molecule operably linked to a DNA 
fragment comprising said regulatory region, and 
introducing said construct into a cell and monitoring 
for the level of expression of said reporter molecule. 

When the method is used to identify a variant which 
alters the activity of the transcription regulatory 
control region, the method may include the further 
step of comparing the levels of expression of a wild 
type and a polymorphic regulatory region as described 
herein. 

According to each of the aspects of the invention, the 
regulatory region includes a polymorphic variation, 
preferably in a recognition site for a transcription 
factor of said regulatory region, and preferably in an 
activator protein-3 motif (AP-3) and/or a basic 
transcription element (BTE) . In a preferred 
embodiment the variant occurs at position -475 or -147 
of the region flanking the sequence encoding CYP3A5 , 
and which region is illustrated in Figure 7 . 
Preferably, both the variants are present. 
The methods of the present invention will be 
particularly valuable to establish, prior to treatment 
with a drug, whether the drug will be effectively 
metabolised by the patient. 

Experimental Procedures 

Liver microsome pr&paration 

Human liver samples were obtained from kidney 
transplant donors, and flash- frozen immediately on 
removal. Human liver microsomes were prepared 
according to previously described protocols (21), and 
protein content was determined by the method of Lowry 
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as modified by Miller (22) . 
Midazolam hydroxylase assay 

The rates of midazolam overall metabolism and of the 
formation of 1- and 4-OH-midazolarn were determined as 
follows. Each incubation vessel contained an aliquot 
of the microsomal suspension (containing 1 mg of 
microsomal protein) in 1.15 % KC1 - 0.01 M phosphate 
buffer pH 7.4; 10 Fl of a stock solution of 6 mM 
midazolam dissolved in DMSO to reach a final midazolam 
concentration of 60 FM; 500 Fl of a co- factor mixture 
containing 0.5 mg of glucose- 6 -phosphate, 0.5 mg of 
MgCl 2 . 6H 2 0, 0.5 units of glucose-6-phosphate 
dehydrogenase dissolved in 0 . 5 M Na-K-phosphate 
buffer, pH 7.4 and a 1.15 % KC1 - 0.01 M phosphate 
buffer pH 7,4 to bring the incubation volume to 0,9 
ml. After a pre- incubation for 5 min at 3 7EC, the 
incubations were started by adding 10 0 Fl of a 
solution of 1.25 mg/ml NADP to reach a final 
concentration of 0.125 mg/ml. Tubes were continuously 
shaken at 100 oscillations/min in an Heto shaking 
waterbath. Blank incubates with boiled microsomes 
were incubated under identical conditions as the 
control incubates. The incubations were stopped after 
30 min by immersing the tubes in dry ice. Samples 
were stored at <. -18°C until analysis. The incubation 
samples were analysed for unchanged midazolam and for 
its metabolites 1 1 - and 4 -hydroxymidazolam by HPLC 
with UV-detection. 

HPLC determination of midazolam metabolites 
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The 1-ml samples of midazolam were thawed and diluted 
with 1 ml DMSO. Samples were sonicated for 10 min, 
centrifuged and an aliquot of the supernatant was 
injected directly onto the HPLC- column. The HPLC 
apparatus consisted of a Waters 60 0 MS pump. The 
samples were injected automatically, using a WISP 717 
plus automatic injector. Stainless steel columns (30 
cm x 4.6 mm i.d.) Were packed with Kromasill 18 (5 Fm) 
bound phase by a balanced density slurry procedure 
(Haskel DSTV 122 -C pump, 10 7 Pa) . UV-detection at 
23 0 nm was performed using a Waters 996 Diode Array 
Detector. Elution at 1-ml /min started with a short 
gradient from 100% 0.1 M ammonium acetate, pH 7.0 
(solvent system A) to 50% of solvent system A and 50% 
of solvent system B containing 1M ammonium acetate pH 
7.0, methanol and acetonitrile (10/45/45), over a 1- 
min period, followed by a second gradient to 10 0% 
solvent system B in 15 min. This solvent composition 
was held for 2 min before equilibration with the 
starting conditions. The identity of the metabolites 
of midazolam was confirmed using mass spectroscopy. 
The conversion of UV-peak areas into ng was performed 
by a Millennium 2 02 0 CDS system on a calibration curve 
of midazolam. This calibration curve was made up after 
injection of known amounts of the drug (0, 1059, 2117, 
3176 and 5 02 8 ng) and linear (weighted by l/x) 
regression analysis of the corresponding UV-peak 
areas. The equation of the calibration curve was ng = 
0. 000333 x area (r 2 = 0.9997, n = 5) . The metabolic 
activity was expressed as pmol metabolite formed/min 
mg protein, and a metabolic ratio was determined for 
each sample according to the ratio of 10HM/40HM in 
each sample. 

Genomic DNA preparation 

DNA was isolated from frozen liver samples using a 




QIAmp Tissue Kit (QIAGEN) in accordance with the 
manufacturers instructions. 



RNA preparation 

RNA was isolated from the liver samples using a QIAGEN 
RNAeasy Midi Kit (QIAGEN) , according to manufacturers 
instructions. Twenty fxg of RNA was treated with RNAse- 
free DNAse I (Boehringer Mannheim) , for 3 0 min at 37°C 
in 20 mM Tris-HCl, pH 8.0, 100 mM MgCl 2 . Samples were 
phenol/chloroform extracted, precipitated and 
resuspended in 3 0 ^1 of TE buffer. Two and a half fig 
of the treated sample was reverse transcribed for 5 0 
minutes at 42 °C in 1 x first strand buffer, 0.01M DTT 
and 0.5M dNTPs using 0.5 /ig of oligo(dt) random 
primers and 2 00 units Superscript II Reverse 
Transcriptase (GibcoBRL) for use on the ABI Prism 770 0 
Sequence Detection System (SDS) . 

Sequencing of the CYP3A5 5' flanking region 

A 1343 bp 5 1 flanking region of CYP3A5 was PCR 
amplified from genomic DNA isolated from liver 
samples, using primers 3A51 (5'- 
GGAAGCAACCTACATGTCCATC ) and 3 A5 2 ( 5 1 - 

ATCGCCACTTGCCTTCTTC) based on the published sequence 
of Jounaidi et a.1 . (11). PCR conditions were 1 cycle 
of 95°C for 1 min, 30 cycles of 95°C for 1 min, 57°C 
for 30 sec, 72 °C for 2.5 min, and 1 cycle of 72 °C for 
10 min. PCR products were purified using a QIAquick 
PCR Purification Kit (QIAGEN) , sequencing primers were 
designed (Table 1) , and used to directly sequence the 
PCR product on both sense and ant i sense strands by 
cycle sequencing using the ABI BigDye Terminator cycle 
sequencing kit (Perkin Elmer) . Sequencing reactions 
were analysed on an ABI 3 77 automated sequencer. 
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Contig sequences were aligned and compared using the 
Sequence Editor version 1.0.3 software packages 
(Perkin Elmer) and manually edited for identification 
of heterozygote positions. 

PCR detection assays far the A. 147 G and T_ 475 G mutations 
All PCR assays were performed utilising a 1 in 100 
dilution of the original 3A51/3A52 PCR product as 
template, under the following conditions: 1 cycle of 
95°C for 1 min, 30 cycles of 95°C for 1 min, 55°C for 
30 sec, 72 °C for 1 min, and 1 final cycle of 72 °C for 
10 min. All products were sequenced to confirm the 
identity of the product as CYP3A5 . Oligonucleotide 
mismatched primers utilised in the assays were: 3A5F1 
(5 1 - GGGTCTGTCTGGCTGCGC ) , 3A5F2 (5 1 - 
GGGGTCTGTCTGGCTGAGC) , and 3A5R1 (5'~ 

TTATGTGCTGGAGAAGGACG) , where positions of mismatches 
are underlined. 

For the A^ 147 G mutation, PCR was performed using primer 
pair 3A5F2 and 3A5R1 . Twenty fil of PCR product was 
digested for a minimum of 3 hours at 65 °C using 15 
units of Tai I, and the restriction fragments 
visualised by ethidium bromide staining after 
electrophoresis on a 1.5% agarose gel. 

For the T_ 475 G mutation, PCR was performed using primer 
pair 3A5F2 and 3A5R1 as described above. Twenty fxl of 
PCR product was digested with 15 units of Alu I for a 
minimum of 3 hours, and restriction fragments were 
separated by electrophoresis on a 12.5% ExcelGel on a 
Pharmacia Multiphor Electrophoresis system 
(Pharmacia) . Fragments were visualised by silver 
staining in a Hoeffer Automatic Gel Stainder 
(Pharmacia) . 
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To detect the presence of mutations on the same 
chromosome, PCR was performed using primers 3A5F1 and 
3A5R1. Twenty Fl of PCR product was digested for a 
minimum of 3 hours at 65 °C using 15 units of Mvn I, 
and the resulting restriction fragments were 
visualised by ethidium bromide staining after 
electrophoresis on a 1.5% agarose gel. 

Relative quantification and comparison of CYP3A5 RNA 

Relative levels of CYP3A5 mRNA were determined by real 
time PCR using the ABI 7700 SDS (Perkin Elmer) . 
Optimal primers and probes for the detection of CYP3A5 
were designed using the PrimerExpress program, and 
subsequently checked to ensure specificity for CYP3A5 , 

Primers utilised for the quantification PCR were: 
forward - 5 ' -AAGTGGCGATGGACCTCATC-3 ( ; reverse - 5'- 
GAGGAGCACCAGGCTGACA-3 ' . The TaqMan probe was labelled 
with the 5' reporter dye 6-carboxy-f louresine (FAM) , 
and had the sequence 5 1 -CAAATTTGGCGGTGGAAACCTGGC-3 1 . 
Optimal primer/probe ratios and concentrations were 
determined and the experiments run according to 
standard protocols for the ABI 7700 Standard Detection 
System. CYP3A5 mRNA expression for all samples was 
normalised against the expression of P-actin mRNA. The 
threshold cycle (Ct) is the PCR cycle number where the 
ABI 7700 begins to detect an increase in fluorescent 
signal associated with the linear amplification of PCR 
product. The Ct value is dependent on the initial 
amount of template copy. Quantities of CYP3A5 in each 
sample were determined by averaging the Ct from 3 
separate PCR reactions of each sample. Relative 
differences in Ct between samples were calculated by 
subtracting the Ct of each sample from the highest Ct 
within the samples (lowest expression) . Since the 
amount of PCR product doubles with every cycle in the 
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linear range of a PCR the differences in Ct were 
converted into estimated differences of mRNA quantity 
between the samples by calculating 2 &t , where <%2t is 
the difference in cycle threshold between two samples. 

Negative controls were performed on each run to ensure 
that no signals were due to DNA contamination. Control 
samples consisted of RNA samples which had been 
treated in exactly the same manner as for the 
quantitative PGR, but without the addition of the 
reverse transcriptase . 

Statistical Analysis 

Statistical analysis was performed on the JMP 
Statistical program version 3.2.2 (SAS Institute 
Inc.). Metabolic ratio and CYP3A5 mRNA expression data 
were checked to ensure that they conformed to a normal 
distribution. CYP3A5 mRNA expression data did not 
conform to a normal distribution and were In- 
transformed, afterwhich the data was normally 
distributed. Metabolic ratios and expression levels 
were compared between groups using a t-Test. 

Western Blot Analysis 

Forty micrograms of microsomal protein prepared from 
each liver were solubilised in an equal volume of 
Laemmli sample buffer (Biorad) by four cycles of 
freezing and boiling for 10 minutes. Samples were 
loaded onto pre-cast 10% SDS-PAGE Ready Gels (Biorad) 
and electrophoresed for 1 hour at 18 0 V. Separated 
proteins were transferred onto Hybond-P membranes 
(Amersham) using a Trans -blot SD apparatus (Biorad) . 
Membranes were blocked by an overnight incubation at 
4°C in lx PBS containing 5% (w:v) nonfat milk and 0.1% 
(v:v) Tween. Membranes were incubated at ambient 
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temperature for 1 hour in a 1:3000 dilution of 
specific human CYP3A5 antibody (Gentest) in IX PBS, 
2.5% nonfat milk, then rinsed four times in lx PBS, 
2.5% (w:v) nonfat milk, 0.1% (v: v) Tween. Membranes 
were incubated at ambient temperature for 1 hour in a 
1:5000 dilution of Anti-Rabbit IgG peroxidase 
conjugate (Sigma) in lx PBS, 2.5% (w:v) nonfat milk, 
and rinsed as previously. The membranes were 
developed using the ECL Plus Western Blotting 
Detection System (Amersham) according to 
manufacturer's instructions, and visualised by 
autoradiography using Kodak X-Omat film (sigma) . 

Example 1 

Midazolam phenotyping 

A panel of 3 9 liver samples was phenotyped for CYP3A5 
activity, using the metabolism of midazolam to its 1- 
OH metabolite as a marker of activity. Human liver 
microsomal samples containing CYP3A5 in addition to 
CYP3A4 exhibit a significantly greater ratio of 1-OHM 
to 4 -OHM compared with samples containing only CYP3A4 . 
1-OHM/ 4 -OHM ratios between 5 and 9 were observed for 
microsomes containing both CYP3A4 and CYP3A5 . Samples 
containing only CYP3A4 showed 1 -OHM/4 -OHM ratios < 4 
(15) . Analysis of the CYP3A5 phenotypes in our data 
set showed a clear bimodal distribution, with 6 
samples (15%) having metabolic ratios greater then 5, 
and the remaining samples having metabolic ratios 
lying between 1.5 and 3.5 (see Fig. la). Of the 39 
liver samples from which microsomes were prepared for 
metabolic analysis, sufficient tissue was available 
for full DNA and RNA analysis for 26, which included 6 
samples lying in the higher metabolic ratio range. In 
addition to these 2 6 samples microsomes for protein 
analysis were available for a further 3 samples, all 
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of which had metabolic ratios of <4 . 

Analysis of CYP3A5 g&ne 5 9 flanking region 

The 5' flanking region of CYP3A5 was PCR-amplif ied 
from genomic DNA of all 26 samples and sequenced in 
full, as shown in Figure 7. Alignment showed that the 
region was well conserved. Only a small number of 
inter-individual variations were identified in 
addition to a few variations from the published 
sequence (Table 2.) . All variants detected were 
heterozygous, and all samples heterozygous for the 
more frequent A_ 147 G mutation were also heterozygous 
for the T„ 475 G mutation, suggesting that the two 
mutations were linked. These two mutations fall 
within two separate putative regulatory elements, a 
basic transcription element (BTE : A_ 147 G) and an 
activator protein-3 motif (AP-3: T„ 475 G) . None of the 
remaining variants fell within putative regulatory 
domains . 

PCR assays were developed to confirm the presence of 
the A_ 147 G and T_ 475 mutations individually, and to 
ascertain if the two mutations were on the same, or on 
separate chromosomes . The PCR assay for the A_ 147 G 
mutation was based on the creation of a recognition 
site for the restriction enzyme Tai I by utilising an 
oligonucleotide mismatch primer (3A5R1) . This primer 
introduces a Tai I recognition site only when the 
wild-type "A" nucleotide is present at position -147. 
Digestion of the 3 69bp product with Tai I yields 
fragments of 349 and 20bp for the wild-type sequence, 
whilst the product remains undigested if the mutant 
"G" nucleotide is present (Fig. 2) . Similarly, for the 
detection of the T_ 475 G mutation a second 

oligonucleotide mismatch primer was used (3A5F2) . This 
primer introduces a recognition site for the 
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restriction enzyme Alu I when the wild-type T 
nucleotide is present at position -4 75, digesting the 
product to yield fragments of 318, 3 3 and 18 bp. This 
site is lost when the mutant G nucleotide is present, 
yielding digestion products of 336 and 33 bp (Fig. 3) . 

To determine if the mutations were present on the same 
chromosome a PCR assay was developed utilising two 
oligonucleotide mismatch primers (3A5F1 and 3A5R1) , 
both primers introducing recognition sites for the 
restriction enzyme Mvn I when the mutant nucleotides 
are present at positions -147 and -475. If the 
mutations are present on the different chromosomes 
then the original 3 69 bp product is digested to yield 
products of 349/350 bp and 20/19 bp (inseperable by 
gel electrophoresis) , whilst if present on the same 
chromosome the fragment is digested to yield products 
of 33 0 and 2 0/19 bp (data not shown) . In addition to 
confirming the individual genotypes of the samples as 
determined by sequencing the two mutations were, in 
all cases, linked on the chromosome (data not shown) . 

Relationship between CYP3A5 allelic variants, CYP3A5 
mediated metabolism, CYP3A5 mRNA and protein 
expression 

Samples were grouped according to genotype: "Wild- 
type" or "mutant" (containing the linked 
polymorphisms) , and the 1-0HM/4-0HM metabolic ratios 
(mr) were compared between the groups (Fig. 4a) . With 
the exception of one outlier (liver sample number, mr 
= 2.08), all individuals carrying the linked mutations 
had metabolic ratios > 5.0, whilst the wild type group 
all possessed metabolic ratios of < 3.5. The mean 
metabolic ratios for the mutant group were 
significantly higher than those from the wild- type 
group (6.0 ± 2.0 versus 2.7 ± 0.42, mean + standard 
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deviation; p < 0.001). 

Quantitative PCR was used to ascertain if the 
mutations in the 5' flanking region were affecting 
gene expression. Whilst mRNA levels showed greater 
variation than the metabolic data, a degree of 
bimodality was observed (Fig. lb). The mutant group 
had CYP3A5 mRNA levels skewed towards the higher end 
of the expression range, showing significantly higher 
levels of CYP3A5 mRNA than the wild type group (mean 
InCt = 4.03, standard deviation = 0.97, against mean 
InCt = 2.06, standard deviation =1.2, p < 0.006) 
((Fig. 4b). In this case the outlier (presenting with 
the mutant genotype, but wild type metabolic ratio) 
also fell within the lower range of expression (InCt = 
2.9). 

The level of CYP3A5 protein expression levels was 
determined for 29 liver samples by Western blot 
analysis using a CYP3A5 specific antibody. A single 
band of 52 kDa corresponding to CYP3A5 was clearly 
apparent in some samples. With the exception of the 
single outlier with the high expression genotype 
(mutant) and low metabolic ratio phenotype (wild- 
type) , all samples which possessed the high expression 
genotype, a high metabolic ratio and high RMA 
expression level clearly show high levels of CYP3A5 
expression when compared to those samples with the low 
expression genotype and phenotype (Fig. 5) . The 
single outlier with the high expression genotype, but 
low expression phenotype showed levels of CYP3A5 
expression similar to those in the low expression 
genotype group. Longer exposure of the Western blot 
indicated that a very low level of CYP3A5 expression 
was apparent in most samples (data not shown) . 
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The 5' flanking sequences of CYP3A5 obtained in this 
study are virtually identical to those published by 
Jounaidi et al . (11)/ and show little inter-individual 
variation in sequence. Interestingly, Jounaidi et al . 
sequenced two human genomic clones, one of which 
contained the two linked mutations described in detail 
in this report. This would suggest that one clone was 
derived from an individual in the low expression 
group, and one from an individual in the high 
expression/metabolism group. 

Previous studies had suggested that CYP3A5 was 
expressed in 10-30% of livers (7, 8, 9) whilst another 
study has stated that some expression is constitutive 
in all samples (10) . The present study supports the 
findings that some CYP3A5 expression is constitutive, 
with some metabolic activity and mRNA being detected 
in all livers studied, although CYP3A5 protein was not 
convincingly demonstrated in all samples using the 
procedures required. We detected enhanced RNA and 
protein expression in 23% of the samples for which 
tissue was available (6 out of 26) , which is similar 
to the fraction of liver showing expression in 
previous studies. This supports the finding of Boobis 
et al. (10) that some show low level expression is 
constitutive in all liver samples although this can 
only be detected using more sensitive detection 
techniques (such as PCR, and not by Western or 
Northern blot analysis) . 

Whilst both polymorphisms detected lie within putative 
transcriptional regulatory elements, we suspect that 
the variant within the BTE is more likely to be 
responsible for altered expression since it has been 
reported that a BTE flanking the TATA box accounts for 
the constitutive expression of CYP1A1 , and a similar 
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region has been found in several other CYP genes 
including CYP2B1, CYP2B2 , CYP2E1 (16) CYP3A4 (13) and 
CYP3A7 (12) . In the case of CYP3A4 gene this element 
has been shown to bind nuclear extracts (13) and a 
basic transcription element binding factor for CYP3A7 
(12) , pointing to a role of this region in the general 
control of cytochrome P450 expression. The exact 
mechanism of up-regulation of CYP3A5 expression in the 
allelic variant described here remains to be 
determined although the presence of one of the 
mutations within the BTE, and the relevance of this 
element for the expression of other P450s indicates a 
possible mechanistic link. Using methylation 
interference f ootprinting, it has been shown that all 
guanine residues within the BTE, and other guanine 
residues in the vicinity, interacted with the 
transcriptional factor Spl (19) . Given that the 
mutation within the BTE (Spl) described herein alters 
an adenine residue to a guanine residue, then this 
could facilitate binding of transcription factors to 
the variant form of the Spl - 

Although there is considerable overlap in the range of 
CYP3A5 mRNA levels seen in the homozygous and 
heterozygous group, the distribution of metabolic 
ratios is clearly bimodal , as is the amount of CYP3A5 . 
We cannot exclude the presence of other polymorphisms 
that may affect the translation efficiency or protein 
stability of CYP3A5 . But given the better correlation 
between DNA polymorphism and protein level and the 
notorious liability of RNA, the simpler explanation is 
that differential RNA degradation or yield (due to 
differences in sample handling) has blurred the 
distinction between high and low expressers. Whatever 
the explanation for the discrepancy at the mRNA level, 
it does not in any way diminish the predictive value 
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of the DNA polymorphism described. 
There is, however, one individual whose genotype 
(heterozygous mutant) is not predictive of his 
metabolic phenotype (low expression) . The fact that 
CYP3A5 protein as well as mRNA levels were low in this 
outlier indicates that the explanation must be sought 
at the transcriptional level, e.g. in transcription 
factors controlling CYP3A5 expression. 

An AUG element in the 5 ' - untranslated region of the 
BTEB gene has been shown to be, at least in part, 
responsible for cell specific translational control of 
BTEB (20) . Mutations within this region were shown to 
affect BTEB translation. Therefore, whilst the outlier 
in our study has a high expression genotype for CYP3A5 
expression, this individual may have a "poor" 
expression phenotype for BTEB. Additionally, it is 
possible that a mechanism similar to that responsible 
for inducing CYP1A1 expression may also affect CYP3A5 
expression. In addition to the BTE, CYP1A1 expression 
is mediated by a xenobiotic responsive element (XRE) . 
In this case inducers enhance expression by binding to 
a cytosolic receptor (Ah receptor) which is 
translocated into the nucleus (possibly in association 
with an accessory protein coded for at the Arnt gene) , 
where it binds the XRE (17, 18) . Although variations 
in these and other transcription factors could further 
modulate CYP3A5 expression, this does not detract from 
the fact that the polymorphism described here seems to 
be the major determinant of CYP3A5 expression, at 
least in liver. 

Despite the relatively small number of samples 
available for analysis in the present study, strong 
associations have been found between the two linked 
polymorphisms on the one hand and both expression and 
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CYP3A5 mRNA, protein and activity levels in the liver 
on the other hand. The unravelling of a genetic 
mechanism for the polymorphic metabolism by CYP3A5 
will have important consequences in the field of 
pharmacogenetics. The ability to predict metabolism by 
genotyping will greatly facilitate disease association 
studies and may also help to explain adverse reactions 
or poor response to therapeutics which are metabolised 
by this cytochrome P450 isoform. It will also help in 
delineating which factors affecting CYP3A5 activity 
are genetic and which are environmental; for both 
further work will be required to fully understand the 
complex variation in expression observed with this 
enzyme . 

Putative promoter sequence analysis 

Materials and Methods : 

The sequence of the regulatory region of CYP3A5 was 
analysed with the ■ f indpatterns 1 program of the GCG 
sequence analysis package (GCG, Madison, Wisconsin) . 
This program finds specific DNA sequence motifs, 
patterns, and transcription binding sites, whose 
sequences are stored in the program, and are present 
in the sequence of interest. In the present analysis, 
at most one single mismatch or error per pattern is 
allowed in the sequence of interest, to detect if the 
two reported variations alter any known motifs or 
transcription binding sites. Results are identified 
in Figures 9 to 9d. 

The first, GCGTG to GCTTG variation 

removes binding sites for MBF-I_CS, MRE_CS2 , and 
CNBP-SRE. 

The second, CCACC to CCGCC variation 
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replaces binding sites for apoE-undef ined- site -3 , 

ApoE__Bl, APRT- CHOJUS, and APRT - human JUTS 

by GCF-consensus, APRT-mouse__US , GC-box_(l), 

DSE_(1) , Spl_CS4, Spl-hsp70_(l) , hsp70.2, Spl-IE- 

3.3, Spl-IE-4/5, I RE__ ( 1 ) , Spl-TPI_(4) 

does not affect the Yi-consensus pattern 

Both mutations affect transcription factor binding 
sites. 

Electrophoretic mobility shift assay (EMS A) 

An EMSA was carried out using the Spl NUSHIFT Kit from 
Geneka Biotechnology Inc. (Montreal, Canada) according 
to the manufactures instructions. Briefly, a 31-mer 
double-stranded oligonucleotide corresponding to the 
CYP3A5 5 1 -untranslated region containing the A_ 147 G 
polymorphism (5'-GGC AGC TGC AGCCCC GCC TCC TTC TCC 
AGC A-3 1 ) was end-labeled with 32 -P using T4 
polynucleotide kinase. 50,000 cpm (0.5 ng) 
oligonucleotide was incubated with 2 jig HeLa nuclear 
extract for 3 0 min at 16°C. Unlabeled mutant or 
wildtype (5-'GGC AGC TGC AGC CCC ACC TCC TTC TCC AGC 
A-3') oligo nucleotide was added in 50-fold or 100- 
fold excess as indicated. 1 or 2 p,l anti-Spl rabbit 
polyclonal antibody was pre -incubated with the nuclear 
extract at 4°C for 3 0 min as indicated. Nuclear 
extract, anti-Spl antibody and binding buffers were 
from Geneka Biotechnology Inc. Samples were separated 
on a 5% polyacrylamide (3 9:1) gel, in TGE buffer (25 
mM Tris, 190 mM glycine, 1 mM EDTA, pH 8.3) . The dried 
gel was exposed to X-ray film. 
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Analysis of the 5 ' -untranslated region of the CYP3A5 
gene indicated that the A_ 147 G polymorphism might 
create a binding site for the transcription factor 
Spl - An electrophoretic mobility shift assay (EMSA) 
was carried out to test this hypothesis. An oligo 
nucleotide containing the A_ 147 G polymorphism was used 
to assay for binding factors present in HeLa nuclear 
extracts. A band shift was observed (Figure 8, lane 2) 
which was competed away with 50- and 100 -fold excess 
respectively of unlabeled oligo nucleotide (Figure 8, 
lanes 3 and 4) , but not with wildtype oligo nucleotide 
(Figure 8, lanes 5 and 6). This clearly indicates the 
presence of a protein factor in HeLa nuclear extracts 
capable of binding to the A_ 147 G polymorphism region, 
but not to the wildtype region. Incubations in the 
presence of an antibody specific for the transcription 
factor Spl resulted in supershif ting of the A_ 147 G 
polymorphism oligo nucleotide (Figure 8, lanes 7 and 
8) , indicting that Spl is binding to the A_ 147 G 
polymorphism site. 

This change in binding affinity of transcription 
factor Spl to the 5 * -untranslated region of the CYP3A5 
gene might account for the increase in transcription 
from the A_ 147 G polymorphic promoter and in turn, might 
contribute to the increase in metabolic rates 
correlated with the A_ 147 G polymorphisms. 

Genotyping of the cytochrome expression 

A group of 3 00 healthy Caucasian volunteers was 
genotyped for variations T_ 475 > G and A„ 147 >G of the 
cytochrome P4 50 3A5 gene. 

Test rationale 

The first objective concerned allele/genotype 
frequencies . 
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Because the initial study included only 3 0 to 3 5 
different individuals, allelle/genotype frequencies 
could not be determined. Genotyping a group of 3 00 
subjects should permit determination of these 
frequencies and to check whether they are in agreement 
with the Hardy-Weinberg equilibrium. 

The second objective concerned the linkage of the two 
variations. In the initial study, all samples with the 
gene variations T_ 475 > G and A_ 147 > G (only 6 in total) 
were linked. To verify the suggested linkage, both of 
these CYP 3A5 polymorphisms were genotyped on a larger 
population . 

Materials and methods 

In order to minimize genotyping errors, genomic DNA 
samples from 300 healthy Caucasian volunteers were 
genotyped in a microtiterplate based format, which 
ensured a blind and completely independent duplicate 
analysis of each individual sample. 
A 1343 bp 5' flanking region of CYP3A5 was PCR- 
amplified from genomic DNA using primers 3A51/3A52 . 
PCR assays for both variations were performed using a 
1/100 dilution of the original 3A51/3A52 PCR product 
as template. Mismatch primers 3A5F2 and 3A5R1 were 
utilised for both assays. For the A_ 147 > G mutation the 
PCR product was digested with restriction enzyme Tax 
I, and for the T„ 475 > G mutation the PCR product was 
digested with restriction enzyme Alu I. After 
digestion the restriction fragments were separated by 
polyacrylamide gel electrophoresis and visualised by 
silver staining. The genotypes were determined based 
on the DNA fragment patterns by two independent 
observers . 
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Results 

1. Al lei le/geno type frequencies 

In the population of 300 individuals, 53 heterozygous 
subjects (18%) were carrying one copy of each of the 
variations, 246 subjects (82%) were homogenious for 
A_ 147 and T_ 475 , and one individual (0.3%) was carrying 
variations G_ 147 and G_ 475 on both allelles (homozygous) . 
These frequencies are in agreement with 3A5 expression 
found in previous studies (7,8,9) 

The allelle frequencies are in agreement with the 
Hardy -Weinberg equilibrium (Table 3). 

2. Linkage of variations T_ 475 > G and A_ 147 > G 

In all individuals, respectively variations T„ 475 and 
A_ 147 , and variations G_ 475 and G_ 147 , were equally 
represented in genotypes, indicating a strong linkage 
between both variations. Whether this linkage between 
both variations has some functional significance needs 
to be clarified further. As a consequence of the 
linkage, future genotyping will require only the 
analysis of one of the variations, whether it is the 
functional variant or not. 
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Table 1. Primers used for sequencing 5» flanking region 
of CYP3A5 from PCR product 3A51/3A52 (see text) . 



x rimer 


KJL lAllVUtr 




Ot/lJUClllC 1*7 J J 




P 


-12376 -1217 


GGAAGCAACCTACATGTCCATC 






-9786 -963 


AGTACAGGGAGCACAG 


3A5r>08 


R 


-9176 -932 


CACCTATTCATTCCTG 


3A5p02 


F 


-6986 -684 


TGCTATCACCACAGAC 


3A5p07 


R 


-6896 704 


GGTGATAGCAATAGAC 


3A5p03 


F 


-3646 -349 


AGGATGTGTAGGAGTC 


3A5p06 


R 


-4176 -434 


CCTC AC AC AG ATGT AACC 


3A5p04 


F 


-1766 -161 


TAAGAACTCAGGTTCC 


3A5p05 


R 


-1786 -194 


CAGAAACTGAAGTGGAG 


3A52 


R 


+1056 +87 


ATCGCCACTTGCCTTCTTC 



t F = 5» to 3 1 , R = 3' to 5 1 

* Primer locations are based on CYP3A5 sequence data 
of Jounaidi et al (11) 



Table 2. 



Position 


Variant Sequence 


Percentage 


-475 


T-K (T or G) heterozygote 


30.6% (11/36) 


-147 


A-R (A or G) heterozygote 


30.6% (11/36) 
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TABLE 3 



Hardy Weinberg Equilibrium test 



Observed values 







N 


freq 


genotype 


AA 


246 


0.820 


genotype 


AG 


53 


0.177 


genotype 


GG 


1 


0.003 


total 




300 


1 



1.112 = Chi-square (Pearson) 
0.292 = p-value 
1 = d.f. 



Allele A 
Allele G 



N 

545 
55 



freq 

0.908 

0.092 



Test: 

Population: 



CYP3A5 -45 A>G 
CON-JRF-1 



Expected values 
N freq 
247.5 0.825 
50.0 0.167 
2.5 0.008 
300 1 
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SEQUENCE LISTING 

<110> Paulussen, Aimee 
Armstrong, Martin 

<12 0> Genotyping Cytochrome Expression 

<130> 51639/001 

<140> PCT/GB99/04380 
<141> 1999-12-22 

<150> GB 9828619.8 
<151> 1998-12-23 

<160> 13 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 1 

ggaagcaacc tacatgtcca tc 
22 
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<210> 2 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 2 

agtacaggga gcacag 
16 

<210> 3 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 3 

cacctattca ttcctg 
16 

<210> 4 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : primer 
<400> 4 

tgctatcacc acagac 
16 

<210> 5 
<211> 16 
<212> DNA 

<213 > v Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

ggtgatagca atagac 
16 

<210> 6 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 6 
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aggatgtgta ggagtc 
16 

<210> 7 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 7 

cctcacacag atgtaacc 
18 

<210> 8 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 8 

taagaactca ggttcc 
16 

<210> 9 
<211> 17 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 9 

cagaaactga agtggag 
17 

<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

atcgccactt gccttcttc 
19 

<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 11 
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gggtctgtct ggctgcgc 
18 

<210> 12 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 12 

ggggtctgtc tggctgagc 
19 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 13 

tttatgtgct ggagaaggac g 
21 



